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DESCRIPTION 

DECODING METHOD, DECODING APPARATUS, AND PROGRAM 



Technical Field 

The present invention relates to a decoding method, a 
decoding apparatus, and a program. More particularly, the 
present invention relates to a decoding method and a 
decoding apparatus for decoding codes on which coding is 
performed using low density parity check codes, and to a 
program therefor. 



Background Art 

In recent years, research in, for example, 
communication fields such as mobile communication and deep 
space communication, and broadcasting fields such as 
terrestrial-wave or satellite digital broadcasts has 
progressed remarkably. Along with this situation, research 
on coding theories for making error correction coding and 
decoding efficient has been actively carried out. 

As a theoretical limit of code performance, the Shannon 
limit implied by the so-called Shannon's (C. E. Shannon) 
channel coding theorem is known. Research on coding 
theories has been carried out for the purpose of developing 
codes exhibiting performance near this Shannon limit. In 
recent years, as a coding method exhibiting performance near 
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the Shannon limit, for example, techniques for what is 
commonly called "turbo coding", such as parallel 
concatenated convolutional codes (PCCC) and serially 
concatenated convolutional codes (SCCC) , have been developed. 
5 Furthermore, whereas this turbo coding has been developed, 
low density parity check codes (hereinafter referred to as 
"LDPC codes"), which is a coding method that has been known 
for a long time, have attracted attention. 

LDPC codes were proposed first in R. G. Gallager, "Low 

10 Density Parity Check Codes", Cambridge, Massachusetts: M. I. 
T. Press, 1963. Thereafter, LDPC codes reattracted 
attention in D. J. C. MacKay, "Good error correcting codes 
based on very sparse matrices", submitted to IEEE Trans. Inf. 
Theory, IT-45, pp. 399-431, 1999, and M. G. Luby, M. 

15 Mitzenmacher , M. A. Shokrollahi and D. A. Spielman, 

"Analysis of low density codes and improved designs using 
irregular graphs", in Proceedings of ACM Symposium on Theory 
of Computing, pp. 249-258, 1998. 

It is beginning to be known from this recent research 

20 that, for the LDPC codes, as the code length increases, 
performance close to the Shannon limit can be obtained, 
similarly to turbo coding. Furthermore, since the LDPC 
codes have the property that the minimum length is 
proportional to the code length, they have the advantages 

25 that the block error probability characteristics are good, 
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and a so-called error floor phenomenon, which is observed in 
decoding characteristics of turbo coding, hardly occurs. 

Such LDPC codes will now be described in detail below. 
The LDPC codes are linear codes and do not always need to be 
5 two-dimensional, but here, a description is given assuming 
that the LDPC codes are two-dimensional. 

The greatest features of the LDPC codes are that the 
parity check matrix that defines the LDPC codes are sparse. 
Here, a sparse matrix is formed in such a manner that the 

10 number of Is in the elements of the matrix is very small. 

If the sparse check matrix is denoted as H, examples thereof 
include a check matrix in which, as shown in Fig. 1, the 
Hamming weight of each column (number of Is; weight) is "3", 
and the Hamming weight of each row is "6". 

15 As described above, the LDPC codes defined by the check 

matrix H in which the Hamming weight of each row and each 
column is fixed are called "regular LDPC codes" . On the 
other hand, the LDPC codes defined by a check matrix H in 
which the Hamming weight of each row and each column is not 

20 fixed are called "irregular LDPC codes". 

Coding by such LDPC codes is realized by generating a 
generation matrix G on the basis of the check matrix H and 
by generating a code word by multiplying this generation 
matrix G by a two-dimensional information message. More 

25 specifically, a coding apparatus for performing coding by 
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LDPC codes computes a generation matrix G in which the 
equation GH T = 0 holds with a transpose matrix H T of the 
check matrix H. Here, when the generation matrix G is a k x 
n matrix, the coding apparatus multiplies the generation 
5 matrix G by a k-bit information matrix (vector u) , and 
generates an n-bit codeword c (= uG) . The codeword 
generated by this coding apparatus is transmitted with the 
code bit whose value is "0" being mapped to 11 +1" and the 
code bit whose value is "1" being mapped to "1", and is 

10 received at the reception side via a predetermined 
communication channel . 

On the other hand, decoding of the LDPC codes can be 
performed by a message passing algorithm by belief 
propagation on a so-called Tanner graph, which is formed of 

15 a variable node (also called a message node) and a check 
node; this message passing algorithm was proposed by 
Gallager and is known as "probabilistic decoding" . 
Hereafter, the variable nodes and the check nodes are also 
referred to simply as nodes where appropriate. 

20 However, in probabilistic decoding, since messages 

exchanged between nodes are real -number values, in order to 
find an analytical solution, it is necessary to trace the 
probability distribution of the message that takes a 
continuous value. This necessitates analysis involving a 

25 large degree of difficulty. Accordingly, Gallager has 
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proposed an algorithm A or an algorithm B as an algorithm 
for decoding LDPC codes . 

In general, decoding of the LDPC codes is performed in 
accordance with the procedure shown in Fig. 2. Here, the 
5 receiving value (the receiving code sequence) is denoted as 
U 0 (uoi), the message output from the check node is denoted as 
u-j, and the message output from the variable node is denoted 
as vi. Here, the message is a real-number value such that 
the "0" -likeness of the value is represented by a so-called 

10 log likelihood ratio. 

In the decoding of the LDPC codes, initially, as shown 
in Fig. 2, in step Sll, the receiving value U 0 (u 0 i). is 
received, the message Uj is initialized to 0, and a variable 
k that takes an integer as a counter for an iterative 

15 process is initialized to 0. The process then proceeds to 
step S12 . In step S12, based on the received value U 0 (u 0 i), 
a message v ± is determined by performing a computation shown 
in equation (1) (computation of a variable node) . 
Furthermore, based on this message Vi, a message Uj is 

20 determined by performing a computation shown in equation (2) 
(computation of a check node) . 




1 




j = i 
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tanh 



(-y-) = Vtanh 




... (2) 



Here, d v and d c in equations (1) and (2) are parameters 
respectively that indicate the number of Is in the vertical 
direction (column) and in the horizontal direction (row) of 
5 the check matrix H and that can be selected as desired. For 
example, in the case of a (3, 6) code, d v = 3 and d c = 6 . 

In the computation of each of equations (1) and (2) , 
since the message input from an edge (line connecting the 
variable node and the check node) from which a message is to 

10 be output is not used as a parameter for a sum or product 

computation, the range of the sum or product computation is 
from 1 to d v - 1 or 1 to d c - 1 . In practice, the 
computation shown in equation (2) is performed by creating 
in advance a table of a function R(v l7 v 2 ) , shown in equation 

15 (3) , that is defined by one output with respect to two 
inputs v x and v 2 and by using this table continuously 
(recursively) , as shown in equation (4) . 



x=2tanh" 1 {tanh (v t /2) tanh (v 2 /2) } =R (v t , v 2 ) 



. . . (3) 



u j =R(v 1 , R(v 2 , R(v 3 , -"RCvd -2, v d -i)))) 



. . . (4) 



20 



In step S12, furthermore, the variable k is incremented 



by 1, and the process then proceeds to step S13 . In step 



S13, it is determined whether or not the variable k is 
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greater than a predetermined number N of iterative decodings . 

When it is determined in step S13 that the variable k is not 

greater than N, the process returns to step S12, where the 

identical processing is performed again. 

5 When it is determined in step S13 that the variable k 

is greater than N, the process proceeds to step S14, where 

the message v ± serving as the decoded result, which is 

finally output as a result of performing the computation 

shown in equation (5), is determined and output. This 

10 completes the decoding process of the LDPC codes. 
d v 

Vi=U 0i + Z Uj 

j=1 ... (5) 

Here, unlike the computation of equation (1) , the 
computation of equation (5) is performed using the input 
messages from all the edges connected to the variable nodes . 

15 In such LDPC code decoding, for example, in the case of 

(3, 6) code, as shown in Fig. 3, messages are exchanged 
between nodes. In the node (variable node) indicated by " = " 
in Fig. 3, the computation shown in equation (1) is 
performed. In the node indicated by " +" (check node) , the 

20 computation shown in equation (2) is performed. In 

particular, in the algorithm A, the message is formed to be 
two-dimensional; in the node indicated by 11 + ", an exclusive 
OR computation of d c - 1 input messages is performed; and in 
the node indicated by , with respect to the received 
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value R, when all the d v - 1 input messages are different bit 
values, the sign is inverted and output. 

Furthermore, in recent years, research on an 
implementation method of the decoding of LDPC codes has been 
5 carried out. Before describing the implementation method, 
the decoding of LDPC codes is described in a schematic form. 

Fig. 4 shows an example of a parity check matrix of 
(3,6) LDPC codes (a coding rate of 1/2, a code length of 12). 
The parity check matrix of LDPC codes can be written by 

10 using a Tanner graph, as shown in Fig. 5. In Fig. 5, nodes 
indicated by " + " are check nodes, and nodes indicated by " = " 
are variable nodes. The check nodes and the variable nodes 
correspond to the rows and the columns of the parity check 
matrix, respectively. The connecting line between the check 

15 node and the variable node is an edge and corresponds to "1" 
of the check matrix. That is, when the element of the j-th 
row and the i-th column of the check matrix is 1, in Fig. 5, 
the i-th variable node (node of " = M ) from the top and the j - 
th check node (node of "+") from the top are connected to 

2 0 each other by an edge. The edge indicates that the sign bit 
corresponding to the variable node has a constraint 
condition corresponding to the check node. Fig. 5 shows a 
Tanner graph of the check matrix of Fig. 4. 

In the sum product algorithm, which is a method of 

25 decoding LDPC codes, the computation of the variable node 
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10 



15 



and the computation of the check node are repeatedly 
performed . 

In the variable node, as shown in Fig. 6, the 
computation of equation (1) (variable node computation) is 
performed. That is, in Fig. 6, the message v ± corresponding 
to the edge to be calculated is calculated by using the 
messages u x and u 2 from the remaining edges connected to the 
variable node, and the received information u 0 ± . The 
messages corresponding to the other edges are also 
calculated similarly . 

Next, before describing the check node computation, 
equation (2) is rewritten as shown in equation (6) by using 
the equation a x b = exp{ln(|a|) + ln(|b|)} x sign (a) x 
sign(b), where sign(x) is 1 when x > 0 and is -1 when x < 0. 

Vj 



'd c -1 



Uj=2tanh~ 1 | TT tanh^ 
=2tanh~ 1 
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d c -1 



x TT sign(Vi) 
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(6) 



Furthermore, in the case of x > 0, when the definition 
cp(x) = In (tanh (x/2) ) is made, since cp" 1 (x) = 2tanh~ 1 (e" x ) , 
equation (6) can be written as equation (7) . 
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/do-i \ d c -i 

Uj = 0-M I 0(|vi|) x TT sign(Vj) 

V i=1 J 1=1 ...(7) 

In the check node, as shown in Fig. 7, the computation 
of equation (7) (check node computation) is performed. That 
is, in Fig. 7, the message Uj corresponding to the edge for 
5 which a calculation is to be performed is calculated by- 
using the messages v X/ v 2 , v 3 , v 4 , and v 5 from the remaining 
edges connected to the check node. The messages 
corresponding to the other edges are also calculated 
similarly. 

10 The function cp(x) can also be expressed as <p(x) = 

In ( (e x +l) / (e x -l) ) , and when x > 0, cp(x) = (p _1 (x). When the 
functions (p(x) and cp" 1 (x) are implemented as hardware, there 
are cases in which they are implemented using an LUT (Look- 
up Table) , and both of them are the same LUT. 

15 When the sum product algorithm is implemented as 

hardware, it is necessary to repeatedly perform the variable 
node computation expressed by equation (1) and the check 
node computation expressed by equation (7) with an 
appropriate circuit scale and at an appropriate operating 

2 0 frequency. 

As an example of the implementation of the decoding 
apparatus, a description is given first of an implementation 
method in a case where decoding is performed by simply 
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performing the computation of each node one-by-one in 
sequence (full serial decoding) . 

It is assumed here that, for example, codes (a coding 
rate of 2/3, and a code length of 90) represented by a 30 
5 (rows) x 9 0 (columns) check matrix of Fig. 8 are decoded. 
The number of Is of the check matrix of Fig. 8 is 269; 
therefore, in the Tanner graph, the number of edges becomes 
269. Here, in the check matrix of Fig. 8 (the same applies 
to Figs. 15 to 17 (to be described later)), 0 is represented 
10 by 

Fig. 9 shows an example of the configuration of a 
decoding apparatus for decoding LDPC codes once. 

In the decoding apparatus of Fig. 9, a message 
corresponding to one edge is calculated for each clock at 

15 which it operates. 

More specifically, the decoding apparatus of Fig. 9 
includes one memory 104 for reception, two memories 10 0 and 
102 for edges, one check node calculator 101, and one 
variable node calculator 103 . 

20 In the decoding apparatus of Fig. 9, message data is 

read one-by-one from the memory 100 or 102 for edges, and by 
using the message data, the message data corresponding to 
the desired edge is calculated. Then, the message data 
determined by that calculation is stored one-by-one in the 

25 memory 100 or 102 for edges at a subsequent stage. When 
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iterative decoding is to be performed, the iterative 
decoding is realized by serially concatenating a plurality 
of the decoding apparatuses of Fig. 9 for decoding LDPC 
codes once or by repeatedly using the decoding apparatus of 
5 Fig. 9. Here, it is assumed that, for example, a plurality 
of the decoding apparatuses of Fig. 9 are connected. 

The memory 10 0 for edges stores output messages D100 
supplied from the variable node calculator 103 of the 
decoding apparatus (not shown) at a previous stage in the 

10 order in which the check node calculator 101 at a subsequent 
stage reads them. Then, at the phase of the check node 
calculation, the memory 100 for edges supplies, to the check 
node calculator 101, the messages D100 as a message output 
D101 in the order in which they are stored. By using the 

15 message D101 supplied from the memory 100 for edges, the 
check node calculator 101 performs a computation in 
accordance with equation (7) and supplies a message D102 
determined by that computation to the memory 102 for edges 
at a subsequent stage . 

20 Fig. 10 shows an example of the configuration of the 

check node calculator 101 of Fig. 9 for performing check 
node calculations one-by-one. 

In the check node calculator 101 of Fig. 10, messages vi 
from the variable node corresponding to each column of the 

25 check matrix, which are supplied from the memory 100 for 
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edges, are read one-by-one, and the computation of <p(|vi|) in 
equation(7) is performed using an LUT . Furthermore, cp ( | v ± | ) 
determined from the message vi from the variable node 
corresponding to each column over one row of the check 
5 matrix is integrated. As a result, the integrated value of 
<p(|vi|) determined from the messages vi from all the edges is 
determined. Thereafter, <p(|vi|) which was determined from 
the edge for which the message Uj was desired to be 
determined and which was delayed at the FIFO (FIFO memory) 

10 is subtracted from the integrated value. As a result, for 

the edge for which the message Uj is desired to be determined, 
2<p(|vi|) in equation (7) is determined. That is, by 
subtracting the message from the edge for which the message 
Uj is desired to be determined from the sum of the messages 

15 from all the edges to the check node, the message to the 

edge for which the message Uj is desired to be determined is 
determined. Furthermore, based on the LUT, the computation 
of cp" 1 (Zcp ( | vi | ) ) in equation (7) is performed. At the same 
time, the sign bit of the message Uj , that is, Flsign(vi) in 

20 equation (7) , is also calculated similarly by using an EXOR 
circuit. In the manner described above, the computation of 
equation (7) is performed, and the message Uj is determined. 

In Fig. 10, the check node calculator 101 is shown by 
assuming that each message, together with the sign bit, is 

25 quantized into a total of 6 bits. Since the maximum row 
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weight of the check matrix to be processed here in Fig. 8 is 
9, that is, since the maximum number of messages supplied to 
the check node is 9, the check node calculator 101 has FIFOs 
(First In First Out) for delaying nine messages (<p(|vi|)). 
5 Referring back to Fig. 9, the memory 10 for edges 

stores the messages D102 supplied from the check node 
calculator 101 at a previous stage in the order in which the 
variable node calculator 103 at a subsequent stage reads 
them. Then, at the phase of the variable node calculation, 

10 the memory 10 for edges supplies the message output D102 as 
a message output D103 to the variable node calculator 103 in 
the order in which they are stored. 

The variable node calculator 103 performs a computation 
in accordance with equation (1) by using the message output 

15 D103 supplied from the memory 10 for edges and received data 
(received value of the LDPC code) D104 supplied from the 
memory 104 for reception, and supplies a message D105 
obtained as a result of the computation to the memory 100 
for edges of the decoding apparatus (not shown) at a 

2 0 subsequent stage. 

Fig. 11 shows an example of the configuration of the 
variable node calculator 103 of Fig. 9 for performing 
variable node calculations one-by-one. 

In the variable node calculator 103 of Fig. 11, 

2 5 messages Uj from the check node corresponding to each row of 
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the check matrix, which are supplied from the memory 102 for 
edges, are read one -by-one , and the messages from the check 
node corresponding to each row over one column of the check 
matrix are integrated to determine the integrated value. 
5 Thereafter, the message which was supplied from the edge for 
which the message vi is desired to be determined and which 
was delayed at the FIFO is subtracted from the integrated 
value. Furthermore, by adding the received value u 0 ±, the 
computation of equation (1) is performed on the basis of the 

10 subtracted value obtained thereby. As a result, the message 
vi is determined. That is, by subtracting the message from 
the edge for which the message vi is desired to be determined 
from the sum of the messages from all the edges to the 
variable node, the message to the edge for which the message 

15 vi is desired to be determined is determined. 

Also, in Fig. 11, similarly to the case in Fig. 10, the 
variable node calculator 103 is shown by assuming that each 
message, together with the sign bit, is quantized into a 
total of six bits. Furthermore, in the check matrix to be 

2 0 processed here in Fig. 8, since the maximum value of the 
column weight is 5, the variable node calculator 103 has 
five FIFOs for delaying a message. When a message of the 
column whose weight is less than 5 is calculated, the amount 
of delay at the FIFO is reduced to the value of the weight 

2 5 of the column. 
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Referring back to Fig. 9 again, a control signal (not 
shown) is supplied to the decoding apparatus in accordance 
with the weight of the check matrix. According to the 
decoding apparatus of Fig. 9, if only the capacities of the 
5 memories for edges 100 and 102 and the FIFOs of the check 

node calculator 101 and the variable node calculator 103 are 
sufficient, various codes can be decoded by changing only 
the control signal. 

Although not shown, in the decoding apparatus of Fig. 9, 

10 at the final stage of the decoding, instead of the variable 
node calculation of equation (1) , the computation of 
equation (5) is performed, and the computation result is 
output as the final decoded result . 

When LDPC codes are decoded by repeatedly using the 

15 decoding apparatus of Fig. 9, the check node computation and 
the variable node computation are alternately performed. 
Therefore, for one decoding using the check matrix having 
269 edges, of Fig. 8, 269 x 2 = 538 clocks are required. 
Therefore, for example, in order to perform 50 iterative 

20 decodings, 538 x 50 = 26900 clock operations are necessary 

while 90 pieces of code information (received values), which 
is the code length, are received, and thus, a high-speed 
operation approximately 300 (= 26900/90) times as high as 
the receiving frequency becomes necessary. In this case, if 

25 the receiving frequency is assumed to be several tens of MHz, 
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operation at a speed of GHz or higher is required, and the 
implementation is not easy. 

Furthermore, in a case where, for example, 5 0 decoding 
apparatuses of Fig. 9 are concatenated to decode LDPC codes, 
5 a plurality of variable node calculations and check node 

calculations can be performed simultaneously. For example, 
while the first frame is performing a variable node 
computation, the second frame performs a check node 
computation and the third frame performs a variable node 

10 computation at a previous stage. In this case, while 90 

pieces of code information are received, 269 edges need to 
be calculated, the decoding apparatus needs to operate at a 
frequency approximately 3 (= 269/90) times as high as the 
receiving frequency, and thus realization is sufficiently 

15 possible. However, in this case, the circuit scale becomes, 
in simple terms, 50 times as large as the decoding apparatus 
of Fig. 9. 

Next, a description is given of the implementation 
method of the decoding apparatus in a case where decoding is 
20 performed by simultaneously performing computations of all 
the nodes (full parallel decoding) . 

This implementation method is described in, for example, 
C. Howland and A. Blanksby, "Parallel Decoding Architectures 
for Low Density Parity Check Codes", Symposium on Circuits 
25 and Systems, 2001. 
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Figs. 12A to 12C show the configuration of examples of 
the decoding apparatus for decoding the code (a coding rate 
of 2/3, and a code length of 90) represented by the check 
matrix of Fig. 8. Fig. 12A shows the overall configuration 
5 of the decoding apparatus. Fig. 12B shows the detailed 

configuration of the upper portion in the figure surrounded 
by the dotted line B, of the decoding apparatus of Fig. 12A. 
Fig. 12C shows the detailed configuration of the lower 
portion in the figure surrounded by the dotted line C, of 

10 the decoding apparatus of Fig. 12A. 

The decoding apparatus of Figs. 12A to 12C includes one 
memory 2 05 for reception, two edge interchange devices 2 00 
and 203, two memories 202 and 206 for edges, a check node 
calculator 201 made up of 30 check node calculators 201i to 

15 201 30 , and a variable node calculator 204 made up of 90 
variable node calculator 204x to 204 90 . 

In the decoding apparatus of Figs. 12A to 12C, all the 
message data corresponding to 26 9 edges is read 
simultaneously from the memory 202 or 206 for edges, and by 

20 using the message data, new message data corresponding to 
the 269 edges is computed. Then, all the new message data 
determined as a result of the computation is simultaneously 
stored in the memory 206 or 202 for edges at a subsequent 
stage. By repeatedly using the decoding apparatus of Figs. 

25 12A to 12C, iterative decoding is realized. Each section 
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will now be described below in detail . 

The memory 206 for edges simultaneously stores all the 
output messages D206i to D206 90 from the variable node 
calculators 204i to 204 90 at a previous stage, reads the 
5 messages D206i to D206 90 as messages D207i to D207 90 at the 

next clock (the timing of the next clock) , and supplies them 
as messages D200 (D200i to D200 90 ) to the edge interchange 
device 200 at the subsequent stage. The edge interchange 
device 200 rearranges (interchanges) the order of the 

10 messages D200i to D200 90 supplied from the memory 206 for 
edges in accordance with the check matrix of Fig. 8, and 
supplies necessary messages D201i to D201 30 to the check node 
calculators 201i to 201 30/ respectively. 

The check node calculators 201i to 201 30 perform a 

15 computation in accordance with equation (7) by using the 

messages D201 x to D201 30 supplied from the edge interchange 
device 200, and supplies the messages D202i to D202 30 
obtained as a result of the computation to the memory 202 
for edges . 

20 Fig. 13 shows an example of the configuration of a 

check node calculator 201 m (m = 1, 2,..., 30) of Figs. 12A to 
12C for simultaneously performing check node calculations. 

In the check node calculator 201 m of Fig. 13, similarly 
to the check node calculator 101 of Fig. 10, the check node 

25 computation of equation (7) is performed, and the check node 
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calculations are simultaneously performed for all the edges. 

More specifically, in the check node calculator 201 m of 
Fig. 13, all the messages from the variable node 
corresponding to each column of the check matrix of Fig. 8, 
5 which are supplied from the edge interchange device 200, are 
read simultaneously, and the computation of cp(|v ± |) in 
equation (7) is performed using the LUT. Furthermore, 
(p(| v i|) determined from the message v± from the variable node 
corresponding to each column over one row of the check 

10 matrix is integrated. This enables the integrated value of 
<P(| v i|) determined from the message v± from all the edges to 
be determined. Thereafter, cp ( | v ± | ) determined from the edge 
for which the message Uj is desired to be determined is 
subtracted from the integrated value. As a result, for the 

15 edge for which the message Uj is desired to be determined, 
Scp(| v i|) in equation (7) is determined. That is, by 
subtracting the message from the edge for which the message 
Uj is desired to be determined from the sum of the messages 
from all the edges to the check node, the message to the 

20 edge for which the message Uj is desired to be determined is 
determined. Furthermore, using the LUT, the computation of 
cp" 1 (Sep ( | vi | ) ) in equation (7) is performed. At the same time, 
the sign bit of the message Uj , that is, risign(Vi) in 
equation (7) , is also calculated similarly by using an EXOR 

25 circuit. In the manner described above, the computation of 
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equation (7) is performed, and the message Uj is determined. 

In Fig. 13, by assuming that each message, together 
with the sign bit, is quantized to a total of 6 bits, the 
check node calculator 201 m is shown. The circuit of Fig. 13 
5 corresponds to one check node . For the check matrix to be 
processed here in Fig. 8, since check nodes of 3 0 rows, 
which is the number of the rows thereof, exist, the decoding 
apparatus of Figs. 12A to 12C has 3 0 check node calculators 
201 m shown in Fig. 13. 

10 In the check node calculator 201 m of Fig. 13, nine 

messages can be calculated simultaneously. For the row 
weight of the check matrix to be processed here in Fig. 8, 
the weight of the first row is 8 and the weight of the 
second row is 9, that is, there is one case in which the 

15 number of messages supplied to the check node is 8 and there 
are nine cases in which the number of messages is 9. 
Therefore, the check node calculator 201i has a circuit 
configuration capable of simultaneously calculating eight 
messages similarly to the circuit of Fig. 13, and the check 

20 node calculators 201 2 to 201 30 are configured the same as the 
circuit of Fig. 13. 

Referring back to Figs. 12A to 12C, the memory 202 for 
edges simultaneously stores all the output messages D202i to 
D202 3 o supplied from the check node calculators 201i to 201 3 o 

25 at the previous stage, and at the next time, supplies all 
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the messages D202 x to D202 30/ as the output messages D203i to 
D2 03 30 , to the edge interchange device 2 03 at the subsequent 
stage . 

The edge interchange device 2 03 rearranges the order of 
5 the messages D203i to D203 3 o supplied from the memory 202 for 
edges in accordance with the check matrix of Fig. 8, and 
supplies necessary messages D204i to D204 90 to the variable 
node calculators 204 x to 204 90 , respectively. 

The variable node calculators 204x to 204 90 perform a 
10 computation in accordance with equation (1) by using the 

messages D204;l to D204 90 supplied from the edge interchange 
device 203 and the received data (received values) D205i to 
D205 90 supplied from the memory 205 for reception, and 
supplies messages D206i to D206 90 obtained as a result of the 
15 computation to the memory 2 06 for edges at the subsequent 
stage . 

Fig. 14 shows an example of the configuration of a 
variable node calculator 204 p (p = 1, 2,..., 90) of Figs. 12A 
to 12C for simultaneously performing variable node 
20 calculations. 

In the variable node calculators 204 p of Fig. 14, 
similarly to the variable node calculator 103 of Fig. 11, 
the check node calculation of equation (7) is performed, and 
the check node calculations are simultaneously performed for 
25 all the edges. 
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More specifically, in the variable node calculators 204 p 
of Fig. 14, all the messages Uj from the check node 
corresponding to each row of the check matrix, which are 
supplied from the edge interchange device 203, are read 
5 simultaneously, the message from the check node 

corresponding to each row over one row of the check matrix 
is integrated, and the integrated value is determined. 
Thereafter, the computation of equation (1) is performed by 
subtracting the message supplied from the edge for which the 

10 message v± is desired to be determined from the integrated 

value and by adding the received value u 0 ± to the subtracted 
value obtained thereby. As a result, the message vi is 
determined. That is, by subtracting the message for which 
the message v ± is desired to be determined from the sum of 

15 the messages from all the edges to the variable node, the 

message to the edge for which the message vi is desired to be 
determined is determined. 

In Fig. 14, by assuming that each message, together 
with the sign bit, is quantized to six bits, the variable 

20 node calculators 204 p is shown. The circuit of Fig. 14 

corresponds to one variable node . For the check matrix to 
be processed here in Fig. 8, since variable nodes of 90 
columns, which is the number of the columns thereof, exist, 
the decoding apparatus of Figs. 12A to 12C has 90 variable 

25 node calculators 204 p shown in Fig. 14. 
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In the variable node calculators 204 p of Fig, 14, it is 
possible to simultaneously calculate five messages. The 
check matrix to be processed here in Fig. 8 have 15, 45, 29, 
and 1 columns having weights of 5, 3, 2, and 1, respectively. 
5 Therefore, 15 variable node calculators out of the variable 
node calculators 204i to 204 9 o have the same circuit 
configuration as that of the circuit of Fig. 14. The 
remaining 45, 29, and 1 variable node calculators have the 
circuit configuration capable of simultaneously calculating 

10 3, 2, and 1 messages similarly to the circuit of Fig. 14. 

Although not shown, also, in the decoding apparatus of 
Figs. 12A to 12C, similarly to the case of Fig. 9, at the 
final stage of the decoding, instead of the variable node 
calculation of equation (1) , the computation of equation (5) 

15 is performed, and the computation result is output as the 
decoded result. 

According to the decoding apparatus of Figs. 12A to 12C, 
it is possible to simultaneously calculate all the messages 
corresponding to 26 9 edges at one clock. 

20 When decoding is performed by repeatedly using the 

decoding apparatus of Figs. 12A to 12C, the check node 
computation and the variable node computation are 
alternately performed, and one decoding can be performed at 
two clocks. Therefore, for example, in order to perform 50 

25 decodings, while 90 pieces of code information are received, 
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the decoding apparatus needs to operate at 2 x 50 = 100 
clocks, and thus, approximately the same operating frequency 
as the receiving frequency may be used. In general, since 
the code length of the LDPC codes is as great as several 
5 thousands to several tens of thousands, if the decoding 
apparatus of Figs. 12A to 12C is used, the number of 
decodings can be greatly increased, and the improvement in 
the error correction performance can be expected. 

However, in the decoding apparatus of Figs. 12A to 12C, 

10 since computations of messages corresponding to all the 
edges of a Tanner graph are performed in parallel, the 
circuit scale increases in proportion to the code length. 
When the decoding apparatus of Figs. 12A to 12C is 
configured as an apparatus for performing the decoding of 

15 LDPC codes having a particular check matrix of a particular 
code length and a particular coding rate, it is difficult 
for the decoding apparatus to perform the decoding of LDPC 
codes having another check matrix of another code length and 
another coding rate. That is, unlike the decoding apparatus 

20 of Fig. 9, it is not possible for the decoding apparatus of 
Figs. 12A to 12C to decode various codes even if the control 
signal is changed only, and the dependence on codes is high. 

In addition to the decoding apparatus of Fig. 9 and 
Figs. 12A to 12C, the implementation method for 

25 simultaneously calculating messages in units of four 
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messages rather than one message or all messages is 
described in, for example, E. Yeo, P. Pakzad, B. Nikolic and 
V. Anantharam, "VLSI Architectures for Iterative Decoders in 
Magnetic Recording Channels", IEEE Transactions on Magnetics, 
5 Vol. 37, No. 2, March 2001. In this case, there are 
problems in that, generally, it is not easy to avoid 
simultaneous read-out from or simultaneous writing to 
different addresses of the memory, and memory access control 
is difficult. 

10 Furthermore, a method of implementation by 

approximating the sum product algorithm has also been 
proposed. However, in this method, the deterioration of 
performance is caused to occur. 

Fig. 15 shows an example of a check matrix of LDPC 

15 codes of a code length of 90 and a coding rate of 2/3. When 
the implementation of the decoding apparatus for decoding 
LDPC codes, represented by this check matrix, is considered, 
the design itself of the decoding apparatus for calculating 
messages corresponding to the edge one -by-one or the 

20 decoding apparatus for simultaneously calculating all the 
messages corresponding to the edge is not difficult. 

However, the realization of the decoding apparatus is 
not easy from the aspect of the circuit scale and the 
operat ion speed . 

25 When the codes represented by the check matrix of Fig. 
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15 are decoded by using the decoding apparatus for 
simultaneously calculating particular p edges, in the memory 
in which edge data (messages corresponding to the edge) is 
stored, access of read-out from or access of writing to a 
5 position (address) different for each row or column is 

necessary. Therefore, different FIFOs need to be used for 
each row or column. Furthermore, for the message, there are 
cases in which the order in which it is calculated in the 
check node computation and the order in which it is used in 
10 the next variable node computation are interchanged, and 
thus it is not easy to realize the memory for storing 
messages simply by an FIFO. 

Disclosure of Invention 

15 The present invention has been made in view of such 

circumstances. An object of the present invention is to 
suppress the operating frequency to a sufficiently 
realizable range while suppressing the circuit scale for 
both logic and memory, and to be capable of easily 

20 controlling memory access. 

The decoding method of the present invention includes a 
decoding step of decoding LDPC codes by using a 
transformation check matrix obtained by performing one of or 
both a row permutation and a column permutation on an 

25 original check matrix. 
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The decoding apparatus of the present invention 
includes decoding means for decoding LDPC codes by using a 
transformation check matrix obtained by performing one of or 
both a row permutation and a column permutation on an 
5 original check matrix. 

The program of the present invention includes a 
decoding step of decoding LDPC codes by using a 
transformation check matrix obtained by performing one of or 
both a row permutation and a column permutation on an 
10 original check matrix. 

In the present invention, LDPC codes are decoded by 
using a transformation check matrix obtained by performing 
one of or both a row permutation and a column permutation on 
an original check matrix. 

15 

Brief Description of the Drawings 

Fig. 1 illustrates a check matrix H of LDPC codes. 
Fig. 2 is a flowchart illustrating a procedure for 
decoding LDPC codes. 
20 Fig. 3 illustrates the flow of messages. 

Fig. 4 shows an example of a check matrix of LDPC codes. 
Fig. 5 shows a Tanner graph of the check matrix. 
Fig. 6 shows a variable node. 
Fig. 7 shows a check node. 
2 5 Fig. 8 shows an example of a check matrix of LDPC codes. 
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Fig. 9 is a block diagram showing an example of the 
configuration of an LDPC code decoding apparatus for 
performing node computations one-by-one. 

Fig. 10 is a block diagram showing an example of the 
5 configuration of a check node calculator for calculating 
messages one-by-one . 

Fig. 11 is a block diagram showing an example of the 
configuration of a variable node calculator for calculating 
messages one-by-one . 
10 Fig. 12A is a block diagram showing an example of the 

configuration of an LDPC code decoding apparatus for 
simultaneously performing all node computations. 

Fig. 12B is a block diagram showing an example of the 
configuration of the LDPC code decoding apparatus for 
15 simultaneously performing all node computations. 

Fig. 12C is a block diagram showing an example of the 
configuration of the LDPC code decoding apparatus for 
simultaneously performing all node computations. 

Fig. 13 is a block diagram showing an example of the 
20 configuration of a check node calculator for simultaneously 
calculating messages. 

Fig. 14 is a block diagram showing an example of the 
configuration of a variable node calculator for 
simultaneously calculating messages. 
25 Fig. 15 shows an example of a check matrix of LDPC 
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codes . 

Fig. 16 shows a matrix in which a row permutation and a 
column permutation are performed on a check matrix. 

Fig. 17 shows a matrix, which is divided into 5x5 
5 units. 

Fig. 18A is a block diagram showing an example of the 
configuration of an embodiment of a decoding apparatus to 
which the present invention is applied. 

Fig. 18B is a block diagram showing an example of the 
10 configuration of the embodiment of the decoding apparatus to 
which the present invention is applied. 

Fig. 18C is a block diagram showing an example of the 
configuration of the embodiment of the decoding apparatus to 
which the present invention is applied. 
15 Fig. 19 is a block diagram showing an example of the 

configuration of an embodiment of a computer to which the 
present invention is applied. 

Best Mode for Carrying Out the Invention 
20 Specific embodiments to which the present invention is 

applied will be described below in detail with reference to 
the drawings . 

Fig. 16 shows a matrix in which a row permutation of 
equation (8) and a column permutation of equation (9) are 
25 performed on a check matrix of Fig. 15. 
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Row permutation: 6x+y+l-th row — » 5y+x+l-th row 

... (8) 

Column permutation: 6s+t+61-th column -> 5t+s+61-th 
column ... (9) 

5 In equations (8) and (9), x, y, s, and t are each an 

integer in the range of 0 < x < 5, 0 < y < 6, 0 < s < 5, and 
0 < y < 6 . 

According to the row permutation of equation (8), a 
permutation is performed in such a manner that the 1st, 7th,, 

10 . 13th, 19th, and 25th rows which, when divided by 6, give a 
remainder of 1, is permuted in the 1st, 2nd, 3rd, 4th, and 
5th row, and the 2nd, 8th, 14th, 20th, and 26th rows which, 
when divided by 6, give a remainder of 2, are permuted in 
the 6th, 7th, 8th, 9th, and 10th rows. 

15 According to the column permutation of equation (9), 

with respect to the 61st and subsequent columns, a 
permutation is performed in such a manner that the 61st, 
67th, 73rd, 79th, and 85th columns which, when divided by 6, 
give a remainder of 1, are permuted in the 61st, 62nd, 63rd, 

20 64th, and 65th columns, and the 62nd, 68th, 74th, 80th, and 
86th column which, when divided by 6, give a remainder of 2, 
are permuted in the 66th, 67th, 68th, 69th, and 70th columns. 

The matrix obtained by performing row and column 
permutations on the check matrix of Fig. 15 in this manner 

25 is the check matrix of Fig. 16. 
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It is self-explanatory that a 0 vector is output when 
the check matrix of Fig. 16 (hereinafter referred to as a 
"permutation check matrix" where appropriate) is multiplied 
by that on which the same permutation as equation (9) is 
5 performed on the sequence of codewords of codes (original 

codes with no errors) represented by the check matrix of Fig. 
15 (hereinafter referred to as an "original check matrix" as 
appropriate) . That is, if the original check matrix is 
denoted as a matrix H, the permutation check matrix is 

10 denoted as a matrix H 1 , the sequence of the codewords of the 
original codes is denoted as a row vector c, and a row 
vector obtained by performing a column permutation of 
equation (9) on the row vector c is denoted as c', Hc T (the 
superscript T indicates a transpose) becomes a 0 vector from 

15 the property of the check matrix. Naturally, H'c ,T also 
becomes a 0 vector. 

Based on the above, the transformation check matrix of 
Fig. 16 is a check matrix of codes c ! in which the sequence 
c of the codewords of the original codes on which the column 

20 permutation of equation (9) is performed are codewords. 

Therefore, when data that is coded by the original code 
is received and decoded, even if the column permutation of 
equation (9) is performed on the received code sequence, the 
code sequence after the column permutation is decoded using 

25 the decoding apparatus based on the transformation check 
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matrix of Fig. 16, and the inverse permutation of the column 
permutation of equation (2) is performed on the sequence of 
the decoded result, there is no difference in the decoded 
result from the case in which the decoding apparatus based 
5 on the original codes is used. As a result, no 

deterioration of the performance is caused to occur. That 
is, the decoding apparatus based on the original check 
matrix of Fig. 15 can be realized by using the decoding 
apparatus based on the transformation check matrix of Fig. 
10 16 . 

Fig. 17 shows the transformation check matrix of Fig. 
16 in which intervals are provided in units of 5 x 5 
matrices . 

In Fig. 17, the check matrix (transformation check 
15 matrix) is represented by a combination of a 5 x 5 unit 

matrix, a matrix in which one or more Is, which are elements 
of the unit matrix, are substituted with 0 (hereinafter 
referred to as a "quasi-unit matrix" where appropriate) , a 
matrix in which the unit matrix or the quasi-unit matrix is 
20 cyclically shifted (hereinafter referred to as a "shift 

matrix" where appropriate) , a sum of two or more of the unit 
matrix, the quasi-unit matrix, the shift matrix (hereinafter 
referred to as a "sum matrix" where appropriate) , and a 5 x 
5 0 -matrix . 

25 It may be said that the check matrix of Fig. 17 is 



- 34 - 



formed of a 5 x 5 matrix, a quasi-unit matrix, a shift 
matrix, a sum matrix, and a 0 -matrix. Therefore, these 5 x 
5 matrices, which form the check matrix, will be hereinafter 
referred to as "formation matrices" where appropriate. 
5 For decoding codes represented by the check matrix 

represented by such P x P formation matrices, architecture 
for simultaneously performing p calculations of check nodes 
and variable nodes can be used. 

Figs. 18A to 18C are block diagrams showing an example 

10 of the configuration of an embodiment of such a decoding 

apparatus. That is, Figs. 18A to 18C show an example of the 
decoding apparatus for performing decoding of LDPC codes by 
using a transformation check matrix of Fig. 17, obtained by 
performing a row or column permutation on the original check 

15 matrix of Fig. 15. 

Fig. 18A shows the overall configuration of the 
decoding apparatus. Fig. 18B shows the detailed 
configuration of the left portion in the figure surrounded 
by the dotted line B, of the decoding apparatus of Fig. 18A. 

20 Fig. 18C shows the detailed configuration of the right 

portion in the figure surrounded by the dotted line C, of 
the decoding apparatus of Fig. 18A. 

The decoding apparatus includes a memory 3 00 for 
storing edge data, which is made up of six FIFOs 300 x to 300 6 , 

25 a selector 301 for selecting the FIFOs 300i to 300 6 , a check 
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node calculator 302, two cyclic shift circuits 303 and 308, 
a memory 304 for storing edge data, which is made up of 18 
FIFOs 304i to 304i 8 , a selector 305 for selecting the FIFOs 
304i to 304i 8 , a received data memory 306 for storing 
5 received information, a variable node calculator 307, a 
decoded word calculation section 3 09, a received data 
rearranging section 310, and a decoded data rearranging 
section 311. 

Before describing in detail each section of the 
10 decoding apparatus, the method of storing data in the 

memories 300 and 304 for storing edge data will be described 
first . 

The memory 3 00 for storing edge data includes six FIFOs 
3 0 0 X to 3 0 0 6 , the number being such that the number of rows, 

15 30, of the transformation check matrix of Fig. 17 is divided 
by the number of rows, 5. The FIFO 300 y (y = 1, 2, . . ., 6) is 
formed in such a manner that messages corresponding to five 
edges, which is the number of the rows and columns of the 
formation matrix, can be read or written simultaneously. 

2 0 The length (the number of stages) thereof is 9, which is the 
maximum number of the number of Is (Hamming weight) in the 
row direction of the transformation check matrix of Fig. 17. 

In the FIFO 300i, the data corresponding to the 
positions of Is from the first row up to the fifth row of 

25 the check matrix (transformation check matrix) of Fig. 17 is 
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stored in such a manner that Is are padded closer (in a 
manner in which Os are ignored) in the horizontal direction 
for each row. That is, if the j-th row and the i-th column 
is denoted as (j, i) , in the first element (the first stage) 
5 of the FIFO 300!, data corresponding to the positions of Is 
of the 5x5 unit matrix from (1, 1) to (5, 5) of the check 
matrix is stored. In the second element, the data 
corresponding to the positions of Is of the shift matrix 
(shift matrix in which the 5x5 unit matrix is cyclically 

10 shifted by three to the right) from (1, 21) to (5, 25) of 
the check matrix is stored. Also, in the third to eighth 
elements, similarly, data is stored in such a manner as to 
correspond to the check matrix. In the nine element, data 
corresponding to the positions of Is of the shift matrix 

15 (shift matrix in which Is of the first row within the 5x5 
unit matrix are substituted with 0 and is cyclically shifted 
by one to the left) from (1, 86) to (5, 90) of the check 
matrix is stored. Here, in the shift matrix from (1, 86) to 
(5, 90) of the check matrix, since there are no Is in the 

20 first row, the number of elements becomes 8 only for the 
first row of the FIFO 300i, and the number of elements 
becomes 9 for the remaining rows. 

In the FIFO 3 00 2 , the data corresponding to the 
positions of Is from the sixth row up to the tenth row of 

25 the check matrix of Fig. 17 is stored. That is, in the 
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first element of the FIFO 300 2 , data corresponding to the 
positions of Is of the first shift matrix forming the sum 
matrix from (6, 1) to (10, 5) of the check matrix (the sum 
matrix which is the sum of the first shift matrix in which 
5 the 5x5 unit matrix is cyclically shifted by one to the 
right and the second shift matrix in which the 5x5 unit 
matrix is cyclically shifted by two to the right) is stored. 
In the second element, data corresponding to the positions 
of Is of the second shift matrix forming the sum matrix from 

10 (6, 1) to (10, 5) of the check matrix is stored. 

More specifically, for the formation matrix whose 
weight is 2 or more, the data (the message corresponding to 
the edge belonging to the unit matrix, the sum matrix, or 
the shift matrix) corresponding to the positions of Is of 

15 the unit matrix, the quasi-unit matrix, or the shift matrix, 
whose weight is 1, when the formation matrix is represented 
in the form of the sum of two or more of the P x p unit 
matrix whose weight is 1, a quasi-unit matrix in which one 
or more Is, which are elements of the unit matrix, are 

20 substituted with 0, and a shift matrix in which the unit 
matrix or the quasi-unit matrix is cyclically shifted, is 
stored at the same address (the same FIFOs among the FIFO3 0 0i 
to 300 6 ) . 

Hereafter, for the third to the ninth elements, the 
25 data is stored in such a manner as to correspond to the 



- 38 - 



check matrix. The number of elements of the FIFO 300 2 is 9 
for all the rows. 

For the FIFOs 300 3 to 300 6 , similarly, data is stored in 
such a manner as to correspond to the check matrix, and the 
length of each of the FIFOs 300 3 to 300 6 is 9. 

The memory 3 04 for storing edge data is formed of 18 
FIFOs 304i to 304 18/ the number being such that the number 90 
of rows of the check matrix is divided by 5, which is the 
number of the rows of the formation matrix. The FIFO 304 x (x 
= 1, 2,..., 18) is formed in such a manner that messages 
corresponding to five edges, the number being the number of 
the rows and the number of the columns of the formation 
matrix, can be read or written simultaneously. 

In the FIFO 3 04x, data corresponding to the positions of 
is from the first column up to the fifth column of the check 
matrix of Fig. 17 is stored in such a manner as to be padded 
closer in the vertical direction for each column (in a 
manner in which 0s are ignored) . That is, in the first 
element (the first stage) of the FIFO 304 i# data 
corresponding to the positions of Is of the 5x5 unit 
matrix from (1, 1) up to (5, 5) of the check matrix is 
stored. In the second element, data corresponding to the 
positions of Is of the first shift matrix forming the sum 
matrix of (6, 1) to (10, 5) of the check matrix (the sum 
matrix, which is the sum of the first shift matrix in which 
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the 5x5 unit matrix is cyclically shifted by one to the 
right and the second shift matrix in which the 5x5 unit 
matrix is cyclically shifted by two to the right) is stored. 
In the third element, data corresponding to the positions of 
5 Is of the second shift matrix forming the sum matrix of (6, 
1) up to (10, 5) of the check matrix is stored. 

More specifically, for the formation matrix whose 
weight is 2 or more, the data (the message corresponding to 
the edge belonging to the unit matrix, the quasi-unit matrix, 

10 the sum matrix, or the shift matrix) corresponding to the 
positions of Is of the unit matrix, the quasi-unit matrix, 
or the shift matrix, whose weight is 1, when the formation 
matrix is represented in the form of the sum of two or more 
of the P x P unit matrix whose weight is 1, a quasi-unit 

15 matrix in which one or more Is, which are elements of the 
unit matrix, are substituted with 0, and a shift matrix in 
which the unit matrix or the quasi-unit matrix is cyclically 
shifted, is stored at the same address (the same FIFOs among 
the FIFOs 304! to 304 i8 ) . 

20 Hereafter, for the fourth and fifth elements, also, 

data is stored in such a manner as to correspond to the 
check matrix. The number of elements (the number of stages) 
of the FIFO 304i is 5, which is the maximum number of the 
number of Is (Hamming weight) in the row direction from the 

25 first column up to the fifth column of the check matrix. 



- 40 - 



Also, for the FIFOs 304 2 and 304 3/ similarly, data is 
stored in such a manner as to correspond to the check matrix, 
and each of the length (the number of stages) thereof is 5. 
Also, for the FIFOs 304 4 to 304i 2 , similarly, data is stored 
5 in such a manner as to correspond to the check matrix, and 
each of the lengths thereof is 3. Also, for the FIFOs 304i 3 
to 304i 8 , similarly, data is stored in such a manner as to 
correspond to the check matrix, and each of the lengths 
thereof is 2. However, since the first element of the FIFO 

10 304i 8 corresponds to (1, 86) up to (5, 90) of the check 

matrix, and there are no Is in the fifth column ((1, 90) up 
to (5, 90) of the check matrix), data is not stored. 

A description will now be given below in detail of the 
operation of each section of the decoding apparatus of the 

15 Figs. 18A to 18C. 

The memory 300 for storing edge data includes six FIFOs 
300! to 300 6 . The FIFO for storing data is selected from 
among the FIFOs 300 x to 300 6 in accordance with the 
information on which row of the check matrix the five pieces 

20 of the message data supplied from the cyclic shift circuit 
308 at a previous stage belongs to. In the selected FIFO, 
the five pieces of the message data D311 are collectively 
stored in sequence. When data is to be read, the memory 300 
for storing edge data sequentially reads five pieces of the 

25 message data D300 x from the FIFO 300i, and supplies them to 
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the selector 301 at the subsequent stage. After the reading 
of the message data from the FIFO 300i is completed, the 
memory 300 for storing edge data reads the message data in 
sequence also from the FIFOs 300 2 to 300 6 and supplies it to 
5 the selector 301. 

The selector 301 selects five pieces of the message 
data from the FIFO from which data has been currently read 
from among the FIFOs 300i to 300 6 in accordance with the 
select signal D301, and supplies them as message data D302 

10 to the check node calculator 302. 

The check node calculator 3 02 includes five check node 
calculators 302 x to 302 5 . The check node calculator 302 
performs a computation in accordance with equation (7) by 
using messages D302 (D302i to D302 5 ) supplied through the 

15 selector 301, and supplies the five messages D303 (D303i to 
D3 03 5 ) obtained as a result of the computation to the cyclic 
shift circuit 303. 

The check node calculators 302 ± to 302 5 are each 
configured the same as the check node calculator 101 shown 

20 in Fig. 10. 

The cyclic shift circuit 303 cyclically shifts the five 
messages D303 x to D303 5 calculated by the check node 
calculator 302 on the basis of the information (matrix data) 
D3 05 as to how many the unit matrix in which the 

25 corresponding edge is the origin in the check matrix is 
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cyclically shifted, and supplies the result as a message 
D3 04 to the memory 3 04 for storing edge data. 

The memory 3 04 for storing edge data includes 18 FIFOs 
304i to 304 18 . The FIFO for storing data is selected from 
5 among the FIFOs 3 04i to 3 04 18 in accordance with information 
D3 05 as to which row of the check matrix the five pieces of 
the message data D304 supplied from the cyclic shift circuit 
303 at a previous stage belongs to, and the five pieces of 
the message data D3 04 is stored collectively in sequence in 

10 the selected FIFO. Furthermore, when the data is to be read, 
the memory 3 04 for storing edge data sequentially reads five 
messages D306 x from the FIFO 304x and supplies it to the 
selector 305 at the subsequent stage. After the reading of 
the data from the FIFO 304i is completed, the memory 304 for 

15 storing edge data sequentially reads the message data from 

the FIFOs 304i to 304i 8 and supplies them to the selector 305. 

In accordance with a select signal D307, the selector 
305 selects five pieces of the message data from the FIFO 
from which data has been currently read from among the FIFOs 

20 304x to 304i 8 , and supplies them as message data D308 to the 
variable node calculator 3 07 and the decoded word 
calculation section 309. 

On the other hand, the received data rearranging 
section 310 rearranges the code sequence (received data) 

25 D313 of the LDPC codes, which is received through the 
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communication channel, by performing a column permutation of 
equation (9) , and supplies it as a code sequence D314 to the 
memory 3 06 for received data. The memory 3 06 for received 
data, which has calculated the reception LLR (log likelihood 
5 ratio) from the code sequence D314 supplied from the 

received data rearranging section 310, supplies five pieces 
of the calculated reception LLR, as data D309, to the 
variable node calculator 3 07 and the decoded word 
calculation section 309 . 

10 The variable node calculator 307 includes five variable 

node calculators 307i to 307 5 . The variable node calculator 
3 07 performs a computation in accordance with equation (1) 
by using messages D308 (D308i to D308 5 ) supplied through the 
selector 3 05 and five reception LLR D3 09 supplied from the 

15 memory 306 for received data, and supplies messages D310 

(D310 x to D310 5 ) obtained as a result of the computation to 
the cyclic shift circuit 308. 

Here, the variable node calculators 307 x to 307 5 are 
each configured the same as the variable node calculator 103 

20 of Fig. 11. 

The cyclic shift circuit 308 cyclically shifts messages 
D310i to D310 5 calculated by the variable node calculator 307 
on the basis of the information as to how many the unit 
matrix in which the corresponding edge is the origin in the 

25 check matrix is cyclically shifted, and supplies the result 
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as a message D311 to the memory 300 for storing edge data. 

As a result of the above operation being circulated 
once, one decoding of the LDPC codes can be performed. 
After the decoding apparatus of Figs. 18A to 18C decodes the 
5 LDPC codes a predetermined number of times, in the decoded 
word calculation section 309 and the decoded data 
rearranging section 311, a final decoded result is 
determined and output . 

More specifically, the decoded word calculation section 

10 309 includes five decoded word calculation sections 309 x to 
309 5 . At the final stage of a plurality of decodings, by 
using five messages D308 (D308i to D308 5 ) output from the 
selector 305 and five reception LLR D309 supplied from the 
memory 3 06 for received data, the decoded word calculation 

15 section 309 calculates the decoded result (decoded word) in 
accordance with equation (5) , and supplies the decoded data 
D315 obtained thereby to the decoded data rearranging 
section 311. 

The decoded data rearranging section 311 rearranges the 
20 order of the decoded data D315 by performing an inverse 

permutation of the column permutation of equation (9) on the 
decoded data D315 supplied from the decoded word calculation 
section 309, and outputs it as the final decoded result D316. 
For the portions in which edge data (messages 
2 5 corresponding to the edge) lacks, during storage in the 



- 45 - 



memory (when data is stored in the memories 300 and 304 for 
storing edge data) , no message is stored. During node 
computation (during the check node computation at the check 
node calculator 3 02 and during the variable node computation 
5 at the variable node calculator 307) , no computation is 
performed . 

If a barrel shifter is used for the cyclic shift 
circuits 303 and 308, a desired operation can be realized 
while reducing the circuit scale. 

10 In the above description, an FIFO is used to store edge 

data (the memories 300 and 304 for storing edge data are 
formed by FIFOs) , but a RAM may be used instead of an FIFO. 
In that case, the RAM requires a bit width and a number of 
words of the total number of edges/p, at which p pieces of 

15 edge information (messages corresponding to the edges) , can 
be simultaneously read. For writing into the RAM, at which 
position the data to be written is read when it is read next 
is determined, and the data is written at that position. 
For reading from the RAM, data is sequentially read from the 

20 beginning of the address. If the RAM is used in place of 
the FIFO, the selectors 301 and 305 are not necessary. 

When the physical bit width of the FIFO and the RAM is 
not sufficient, by providing the same control signal by 
using a plurality of RAMs, these can be logically assumed as 

2 5 one RAM. 
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In the above -described case, for the simplification of 
description, a case in which p is 5, that is, the number of 
the rows and the number of the columns of the formation 
matrix forming the check matrix is 5, is used as an example. 
5 However, the number p of the rows and columns of the 

formation matrix is not always necessary to be 5, and it can 
take a different value depending on the check matrix. For 
example, p may be 360 or 3 92. 

In this embodiment, although LDPC codes of a code 

10 length of 90 and a coding rate of 2/3 are used, the code 
length and the coding rate of the LDPC codes may be any 
value. For example, when the number p of the rows and 
columns of the formation matrix is 5, if the total number of 
edges is less than or equal to 5, LDPC codes of any code 

15 length and coding rate can be decoded by using the decoding 
apparatus of Figs. 18A to 18C by only changing the control 
signal . 

The decoding apparatus for particular LDPC codes, that 
satisfy conditions in which the number p of the rows and 

2 0 columns of the formation matrix is a predetermined value and 
the total number of edges is less than or equal to a 
particular value, is able to decode the LDPC codes of any 
desired coding rate at any desired code length, which 
satisfy the conditions. 

2 5 As described above, one of or both a row permutation 
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and a column permutation is performed on a check matrix 
(original check matrix) , so that the check matrix is 
transformed to a check matrix (transformation check matrix) 
that can be represented by a combination of a P x p unit 
5 matrix, a quasi -unit matrix in which one or more Is, which 
are elements of the unit matrix, are substituted with 0, a 
shift matrix in which the unit matrix or the quasi -unit 
matrix is cyclically shifted, a sum matrix, which is the sum 
of two or more of the unit matrix, the quasi-unit matrix, 

10 and the shift matrix, and a P x p 0-matrix, that is, a 

combination of formation matrices. Thus, for decoding LDPC 
codes, architecture for simultaneously performing p check 
node computations and p variable node computations can be 
adopted. As a result, by simultaneously performing p node 

15 computations, the operating frequency can be suppressed 
within a feasible range. Thus, while a large number of 
iterative decodings can be performed, it is possible to 
prevent simultaneous access to different addresses from 
occurring during writing to and reading from the memory 

20 (FIFO and RAM) . 

That is, by performing one of or both the above- 
described row permutation and column permutation of the 
check matrix, it is possible to simultaneously perform p 
check node computations and p variable node computations. 

25 Furthermore, by simultaneously performing p node 
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computations in this manner, the operating frequency can be 
suppressed within a feasible range. Thus, while a large 
number of iterative decodings can be performed, it is 
possible to prevent simultaneous access to different 
5 addresses from occurring during writing to and reading from 
the memory (FIFO and RAM) . 

When LDPC codes represented by the check matrix 
(transformation check matrix) of Fig. 17 are decoded, since 
it is possible to perform computations on 269 edges every 

10 five edges for each check node and each variable node. 

Therefore, for one decoding, the decoding apparatus needs to 
perform 269/5 x 2 = 108 clock operations. Therefore, in 
order to perform 50 decodings, while 90 pieces of code 
information are received, the decoding apparatus needs to 

15 perform 108 x 50 = 5400 clock operations, and thus, an 

operating frequency approximately 60 times as high as the 
receiving frequency may be used. Therefore, according to 
the decoding apparatus of Figs. 18A to 18C, when compared to 
the decoding apparatus of Fig. 9 for performing each node 

20 computation one-by-one, only an operating frequency of 1/5 
is necessary. When viewed from the aspect of the circuit 
scale, since the size of the memory is the same, even if the 
logic circuit becomes large slightly, the influence on the 
whole is small. 

25 In general, since the code length of LDPC codes is as 
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large as several thousands to several tens of thousands, a 
matrix having a size whose value of p is several hundreds is 
used. In that case, the advantages that the decoding 
apparatus of the present invention is used are larger. 
5 As described above, while the circuit scale for both 

logic and memory is suppressed, the operating frequency can 
be suppressed within a feasible range, and control of memory 
access can be performed easily. 

From the properties of LDPC codes, even if a row 

10 permutation and a column permutation are performed on the 
check matrix, the performance of codes does not change. 
Therefore, for LDPC codes corresponding to the check matrix 
from which a transformation check matrix that can be 
represented by a combination of formation matrices is 

15 obtained by a row permutation and a column permutation, even 
if the LDPC codes have any code length and any coding length, 
the deterioration of performance is not caused to occur, the 
implementation is easy, and decoding can be performed with 
high efficiency. 

2 0 Furthermore, since the decoding apparatus according to 

the present invention faithfully implements the sum product 
algorithm, decoding loss other than quantization of messages 
does not occur. 

From the above viewpoints, by using the decoding 

25 apparatus according to the present invention, high- 
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performance decoding becomes possible. 

When the check matrix is not a multiple of the number p 
of the rows and columns of the formation matrix, the present 
invention may be applied by assuming the check matrix to be 
5 a multiple of p by assigning elements of all 0s to the 
outside of the fractions of the check matrix. 

Next, the above-described series of processes can be 
performed by hardware, and it can also be executed by 
software. In a case where the series of processes is 
10 performed by software, the program forming the software is 
installed into a general -purpose computer, etc. 

Accordingly, Fig. 19 shows an example of the 
configuration of an embodiment of a computer to which the 
program for executing the above-described series of 
15 processes is installed. 

The program may be recorded in advance in a hard disk 
4 05 and a ROM 4 03 serving as a recording medium incorporated 
in the computer. 

Alternatively, the program can be temporarily or 
20 permanently stored (recorded) in a removable recording 

medium 411, such as a flexible disk, a CD-ROM (Compact Disc 
Read-Only Memory) , an MO (Magneto optical) disk, a DVD 
(Digital Versatile Disc) , a magnetic disk, or a 
semiconductor memory. Such a removable recording medium 411 
25 can be provided as so-called packaged software. 
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In addition to being installed into a computer from the 
removable recording medium 411 such as that described above, 
programs can be transferred in a wireless manner from a 
download site via a man-made satellite for digital satellite 
5 broadcasting or can be transferred by wire to a computer via 
a network, such as a LAN (Local Area Network) or the Interne. 
In the computer, the programs which are transferred in such 
a manner can be received at a communication section 408, can 
be installed into the hard disk 405 contained therein. 

10 The computer incorporates a CPU (Central Processing 

Unit) 402. An input/output interface 410 is connected to 
the CPU 402 via a bus 401. When an instruction is input by 
the user by operating an input section 407 including a 
keyboard, a mouse, a microphone, etc., via the input/output 

15 interface 410, the CPU 402 executes the program stored in 
the ROM (Read Only Memory) 4 03 in accordance with that 
instruction. Alternatively, the CPU 402 loads, into the RAM 
(Random Access Memory) 404, the program stored in the hard 
disk 405, the program that is transferred from a satellite 

20 or a network, that is received by the communication section 
408, and that is installed into the hard disk 405, or the 
program that is read from the removable recording medium 411 
loaded to a drive 409 and is installed into the hard disk 
405, and executes the program. As a result, the CPU 402 

25 performs processing in accordance with the above -described 



- 52 - 



flowcharts or processing according to the above -described 
block diagrams. Then, for example, the CPU 402 outputs the 
processing result via the input/output interface 410 from an 
output section 406 including an LCD (Liquid-Crystal Display) , 
5 a speaker, etc., or transmits it from the communication 

section 408, and further records it in the hard disk 405 as 
required. 

In this specification, processing steps for writing the 
program for enabling the computer to perform various 

10 processing need not to be executed chronologically according 
to the written orders as flowcharts. Also, they may be 
executed concurrently or individually (for example, parallel 
processing or object-based processing) . 

The program may be processed by one computer or may be 

15 processed by a plurality of computers in a distributed 

manner. Furthermore, the program may be transferred to a 
distant computer and may be processed thereby. 

Industrial Applicability 
20 As has thus been described, according to the present 

invention, while the circuit scale is suppressed, the 
operating frequency can be suppressed within a sufficiently 
feasible range, and control of memory access can be 
performed easily. 



25 



